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Abstract 

In this short article, we study factor colorings of aperiodic linearly 
recurrent infinite words. We show that there always exists a coloring 
which does not admit a monochromatic factorization of the word into 
factors of increasing lengths. 
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1 Introduction 

Consider an infinite word x over a finite alphabet A, and let c be a function 
from the hnite factors of x to some hnite set of colors. We can view it as 
a coloring of 2-subsets of the natural numbers, where the color of {i,j} for 
i < j is the color of the factor x\ij-i]. The inhnite version of Ramsey’s 
theorem states that there is an inhnite c-monochromatic subset of N. If we 
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enumerate this subset as {zq, Zi, Z 2 ,...}, this means that x has a factorization 
X = ''' where every factor except the hrst has the 

same color. 

Now, a natural question arises: can we get rid of the offending hrst factor? 
That is, when can we guarantee that a complete monochromatic factorization 
exists? This question was raised in [B], and the hrst general treatment was 
given in [2], where it was proved for several important classes of inhnite 
words that monochromatic factorizations need not exist. In fact, the authors 
conjectured that all aperiodic words have this property. 

One well known class of words not considered in the study is that of hxed 
points of primitive substitutions. We consider a strictly larger class of words, 
the linearly recurrent ones, which have been studied in |31|S], among others. 
In this short article, we show that every aperiodic linearly recurrent word 
admits a factor coloring with no monochromatic monotone factorization, 
that is, one where the lengths of the factors are increasing. Note that the 
almost monochromatic factorization given by Ramsey’s theorem can be made 
monotone, since any two adjacent factors (except the hrst) can be merged 
together to produce a longer factor with the same color. 

2 Definitions and Preliminary Results 

In this article, the set of integers includes zero: N = {0,1,2,...}. The 
indexing of hnite and inhnite words also starts at 0. We denote by A‘^ the 
right-inhnite words over an alphabet A, while A*, A'^, A^ and A-'^ denote 
words of any hnite length, length at least 1, length exactly n and length at 
most n, respectively. 

Let X G A*UA‘^ be a hnite or right-inhnite word, and let Fact’''x, Prefix C 
A~^ be the sets of nonempty factors and nonempty prehxes of x, respectively. 
A factor coloring of x is a. function c : Fact^x C, where C is a hnite set 
of colors. A factorization of x is a (hnite if x G A*) sequence {Ui)i of words 
in Fact’'"x such that x = where the product denotes concatenation, 

and it is monotone if \Ui\ < \Uj\ for all z < j. Another factorization (Vj)j is 
coarser than {Ui)i if for all j there exists z such that \Vq- ■ ■Vj\ = \Uq- ■ - Uf. 
The factorization (f/j)j is c-monochromatic if c(f/j) = c{Uj) holds for all i,j, 
and strongly c-monochromatic if all factorizations coarser than it are also 
c-monochromatic. 

Definition 1. For h, fc > 1, let V{h, k) be the class of inhnite words x G A‘^ 
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such that there exists a factor coloring c : Fact’''x —?• C with \C\ < k such 
that no prehx of x has a c-monochromatic factorization of length h. We also 
dehne the classes Vjn{h,k), which only considers monotone factorizations, 
and Vs{h, k), which only considers strongly c-monochromatic factorizations. 

We denote V{k) = V{oo, k) and V = ljfc>i analogously for the 

other classes. 

Observe that we trivially have V{h,k) C Vra{h,k) C Vs{h,k) for all 
h,k > 1. Furthermore, V{h^k) C V{h',k') holds whenever h < h' and 
k < k', and similarly for the other two classes. Of course, the degenerate 
cases h = 1 and k = 1 give rise to empty classes. 

Example 2. Let x G be any inhnite word and u G any nonempty 
hnite word. We show that u^x ^ Vjn{h,k) holds for all h,k > 1. Namely, 
if c : Fact’'"M^x —)■ O is any factor coloring of u^x, then u) (with 

h copies of u) is a monotone monochromatic factorization of length h of a 
prehx of x. 

An analogous but somewhat weaker result holds for the classes Ps(h, k). 
Let FT G N be such that every F-colored complete graph of size larger 
than H contains a monochromatic induced subgraph of size h + 1] its ex¬ 
istence follows from the hnite version of Ramsey’s theorem. We claim that 
y = u^x ^ Vs{h,k), and for that, let c : Fact"*"?/ —>■ (F be a factor coloring 
with \C\ < k. Let G be the complete graph whose vertices are the inte¬ 
gers V = {0, |m|, ..., H\u\}, and let c' be the edge coloring of G given by 
= c{.yii,j-i]) for all i < j ^V. By the dehnition of H, there exists 
a monochromatic subset W C V of size h + 1; denote W = {no, ni,..., rih} 
with rii < Uj+i. Since c'{{i,j}) = \u\,j — |w|}) holds for all applicable i 

and j, we may assume uq = 0. But then {y[ni,ni+i-i])o<i<h is a size-h strongly 
monochromatic factorization of a prehx of y. 

We remark that we have not tried to obtain any bounds for the number 
H in the above example, except those given by Ramsey’s theorem. Since the 
edge coloring c' of G is of a special form (all edges of the same length have 
the same color), a more specialized approach would probably yield better 
bounds. 

The following result, which was already mentioned in the introduction, 
is the motivation behind the research direction started in |2]. It was not 
stated for strongly c-monochromatic partitions, but the proof readily gives 
this stronger result. 
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Proposition 3 (Proposition 3.1 in [2]). Let c : Fact ’'“a: C be a factor 
coloring of an infinite word x G . Then there exists i G M such that the 
infinite tail X[i^oo) G has a strongly c-monochromatic factorization. 

Proof. Let c' be a function from the 2-subsets of N to the set C defined 
by c'{{i,j}) = c(x[jj_i]) for all i < j. By the infinite version of Ramsey’s 
theorem, there exists an infinite c'-monochromatic set S' C N. Denoting 
S = {io, ii, * 2 , ■ • •} with ij < ij+i, this implies that every factor X[i.^i._^^_i] has 
the same c-image for all j G N and k > 1. But then the inhnite tail x\i^^ao) 
has the factorization which is strongly c-monochromatic, and 

we choose i = io- □ 

In [2], it was proved (among other results) that all words that are not 
uniformly recurrent are in V, all words with full factor complexity (Fact^x = 
A'^) are in V{2), and all Sturmian words are in V{3). Recall that Sturmian 
words are exactly the infinite words over the alphabet {0,1} that are balanced 
(the number of Is in any two factors of the same length differs by at most 
1). The authors conjectured that all aperiodic infinite words are in the class 
V, and by the results stated above, all still unsolved cases are uniformly 
recurrent. 

The parameter h and the classes Vm and Vs were not considered in [2], 
and some of their basic properties are still unknown. For example, it is clear 
that ljfe>i ^(^5 C V{oo, k) holds for all k > 1, and analogously for the 
other classes, but we do not know whether the inclusion is strict (except in 
the degenerate case k = 1). 

Let X G be an inhnite word. For u G Fact^x, a return to u (in x) is 
a word w G A'^ such that wu G Fact’'"x and u occurs exactly twice in wu. 
The set of returns to u in x is denoted TZxiu). We say that x is uniformly 
recurrent, if for every factor u G Fact’''x, the set Vx{u) is nonempty and hnite. 
Equivalently, every factor of x occurs inhnitely many times and with bounded 
gaps. If X is uniformly recurrent, then for each nonempty prehx u of x, we 
have a unique factorization x = rorir 2 ■ ■ ■ where rj G TZx{u) for all i G M. 
We hx some enumeration of TZx{u), which dehnes a bijection 0“ : Rx{u) —)■ 
Vx{u), where Rx{u) = {0,1,..., |7^a;(-u)| —1}. We extend 0“ into a morphism 
from Rx{u)* to A* by 0“(uoUi • ■ -nk-i) = 0“(’^o)0x(’^i)'' The 

following lemma can be found in [1]. 

Lemma 4 (Proposition 2.6 in |3]). Let x G A^ be a uniformly recurrent 
word, and let u,v E Pref’'"x with |u| < |m|. 
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1. The morphism 0“ : Rx{u)* —)■ A* is injective. 

2. We have TZx{u) C Q'^{Rx{v)*). 

3. There is a unique morphism A“ : Rxiu)* —)• Rxiv)* such that 0^ o A“ = 
0 “. 

A word X G is linearly recurrent (with constant K) if for all u G Fact’''x 
and all w G Tlxiu) we have |r(;| < K\u\. It is periodic if x = x\^n,oo) for some 
n > 1, and aperiodic otherwise. We need the following facts abont linearly 
recnrrent words, which can be fonnd in [3]. 

Lemma 5 (Theorem 24 in [3]). Let x G A^ be an aperiodic linearly recurrent 
word with constant K. 

1. For all u G A+ we have ^ Fact’'"x. 

2. For all u G Fact^x and w G Tlxiu) we have K\w\ > |m| > \w\/K 

3. For all u G Fact^x we have \TZxiu)\ < K{K + 1)^. 

Important classes of linearly recnrrent words inclnde certain Sturmian 
words, and all aperiodic fixed points of primitive substitutions [3]. Namely, 
a Stnrmian word is linearly recnrrent if and only if the continned fraction 
expansion of its asymptotic density of Is has bonnded coefficients (see P for 
details). Recall that a snbstitntion a : A* ^ A* is primitive if there exists 
n G M snch that all a G A occur in cr"'(6) for any b & A. 


3 Main Results 

In this section, we present our main result. 

Theorem 6. If x ^ is linearly recurrent with constant K and aperiodic, 
then X G Vm- More specifically, we have x G Vm{K + l,k) for k = 2 + 
+ Iff 

Proof. Denote = X[o,itn_i], the prehx of x of length K^, and let B = 
{0,..., K{K + 1)^ — 1}. We use B as a common index set for the return 
words of every prehx pn, since it contains Rxipn) for all n G N. Consider a 
word u G Fact^x. If u G IZx{Pn)"^ for some m G N, then Lemma 0] implies 
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that there exists a unique word r G such that u = We claim 

that |r| < mK^. This is because |tc| > \pn-i\/K = holds for every 

w G Tixipn-i) by Lemma m so that we have > mK'^~‘^\r\, while 

|m| < miL|pri| = We denote this r by r{u). Finally, denote by n{u) 

the unique number n G N such that < |w| < 

Using the notions dehned in the above paragraph, we dehne the following 
factor coloring c : Facf''x —)■ C of x, where C = {#, $} U (Z 2 x 

{ {n{u) mod 2, r(n)), if u G 'JZx{Pniu))-^\ 

$, if M ^ T^xiPniu))-^^ and u G Pref’''x, 

otherwise. 

Note that \C\ = 2 + 2K\K + l f\ 

We claim that no prehx of x admits a monotone and c-monochromatic 
factorization of length K + 1. Suppose on the contrary that such a factor¬ 
ization exists: UqUi ■ ■ ■ Uk G Pref’'"x where 0 < |f/i| < |[/j| for all i < j, and 
every Ui has the same c-image a E C . Since Uq G Pref’'"x, we have a 7 ^ 

In particular, every Ui is a prehx of x. Let n = n(Uo). Since Ui G Pref’'"x 
and |Ui| > |t/o|, the word pn is a prehx of Ui, and thus Uq G TZx{Pn)"^ for 
some m G M. We have |17o| < by the dehnition of n, and Lemma [5] 

implies |r(;| > K"‘~^ for all w G TZxiPn), so that m < K^. By the dehnition 
of c, we then have a = c{Uq) = (n mod 2, r(t/o)). 

The idea of the rest of the proof is the following. If every Ui has the same 
n-image, then they are equal by the injectivity of Qx{pn-i)i so that x has a 
{K -|- l)-fold repetition as a factor. If this is not the case, then the n-image 
of one of the words is at least 2 larger than that of f/o, so that it is at least 
K times longer than Uq. Since that word is also a prehx of x, it contains a 
{K -|- l)-fold repetition of Uq. In both cases, we obtain a contradiction with 
Lemma El 

We proceed with the proof. Suppose hrst that n(Uo) = n{Ui) for all 
i G {1, 2,..., iP}. In this case, we have 

Ui = e'—(r(f/,)) = e'—(r(f/„)) = f/„ (1) 

for all i, since r{Ui) = r{UQ), so that the words of the factorization are in 
fact equal. But then we have UqUi ■ ■ ■ Uk = U^~^^ G Fact^x, contradicting 
Lemma El 

Suppose then that there exists 1 < i < K such that n{Ui) > n([/o), and 
choose the minimal such i. By equation ([T]), we again have UqUi ■ ■ ■ Uj_i = 
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Uq. Since n{Ui) = niUo) mod 2, we actually have n{Ui) > n{Uo) + 2, which 
implies \Ui\ > K\Uo\. But since Ui is a prefix of x, the word Uq is a prefix of 
Ui- Then the word UqUi ■ ■ ■ f/j-if/g = f/g* is a prefix of x, and the longer one 
of t/g* and Ui is a prefix of the other. Iterating this argument, we see that 
Ui is a prefix of the infinite periodic word Uq , and since \Ui\ > K\Uo\, the 
word Uq is a prefix of Ui. Since we also have U^Ui G Pref’''a; and i > I, this 
implies U^^^ G Fact’'"a;, again a contradiction. □ 

Since every fixed point of a primitive substitution is linearly recurrent, 
we have the following corollary. 

Corollary 7. If X E is the fixed point of an aperiodic primitive substitu¬ 

tion, then X G Pm- 
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